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Abstract: The first data on bottom quark production in pp collisions at 1.96 TeV have 
recently been obtained by the CDF collaboration. These data probe the region of pT ~ 0, 
providing a new invaluable input on the issue of the compatibility between next-to-leading- 
order (NLO) QCD and data. We reconsider the evaluation of the h cross section, in view 
of recent theoretical developments, and of the latest inputs on structure function fits. 
We show that the new CDF measurements are in good agreement with NLO QCD. If 
CDF preliminary data are confirmed, a long-standing discrepancy between NLO QCD 
predictions and hadron-collider data can be settled. 
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1. Introduction 

The measurement of the bottom quark production cross section in pp cohisions has provided 
for the past fifteen years one of the most significant challenges to the ability of perturbative 
QCD to accurately predict absolute rates in hadronic collisions. Measurements of the 
transverse momentum (pt) spectrum in the region pT > nib (the bottom quark mass) 
have been performed by the UAl experiment [1] at the SppS {s/S = 630 GeV) and by the 
CDF [2, 3, 4] and DO [5] experiments at the Tevatron {y/S = 1.8 TeV). Comparisons of 
Tevatron data with next-to-leading-order (NLO, i.e. 0{a^)) predictions [6, 7] have shown 
a systematic excess. The precise size of this excess depends on the input parameters used 
for the theoretical calculation, which is affected by an uncertainty of up to 50% due to the 
choice of renormalization and factorization scales (/xr, /xf) and by additional uncertainties 
due to the choice of parton distribution functions (PDFs) and of the value of the b quark 
mass. Nevertheless, the central value of the NLO prediction has typically been quoted as 
being smaller than the data by factors varying between 2 and 3. 

Aside from the radical and interesting hypothesis that physics beyond the standard 
model is at work [8], the source of this discrepancy in the context of QCD has been searched 
for in various directions. At the perturbative level, the large scale dependence at NLO is 
a symptom of large higher-order contributions. First of all it is well known that there are 
new partonic processes which appear first at 0(Q;f)(such as gluon splitting). Furthermore, 
there are large logarithmic corrections which are present at all orders of perturbation 
theory. These can arise from several sources: on one side there are logarithms of the 
ratio of the hadronic center of mass energy and the quark mass [6, 9, 10] (the so-called 
small-x effects, x ^ mb/\^). On the other, multiple gluon radiation leads at large px to 
towers of logarithms of pr/mb [H]- At the non-perturbative level, it has been noted [12] 
that the pT spectrum of b hadrons (Hb) in hadronic collisions has a large sensitivity to 
the parameterization of the b ^ Hb fragmentation function, Db{z). Even assuming the 
applicability of the factorization theorem, an assumption which at low px remains to be 
validated, it is therefore crucial to ensure that the extraction of Db{z) from e+e~ data is 
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performed in a fashion consistent with its apphcation in the context of hadronic colhsions, 
an issue often overlooked in the past. 

The quantitative analysis of small-x effects in b production has followed several types 
of approaches. Some of them [13] do not attempt to include the exact NLO results, thus 
leading to a radical departure from the QCD-improved parton model. These approaches 
generally lead to very large K factors for heavy flavoTir production, even at high pT, 
where these effects should be reduced. Large effects, strongly dependent on the chosen 
fit of unintegrated gluon densities, are also found in the small-x MC implementation of 
Jung [14]. The approach of Collins and Ellis [9] aims instead at computing the small-x 
enhanced effects that are not already included in the NLO results. In this approach one 
finds at the Tevatron corrections not larger than 20-30%. 

The resummation of the logarithms of pr/^bi with next-to- leading logarithmic accu- 
racy (NLL), and the matching with the fixed-order, exact NLO calculation for massive 
quarks, has been performed in [15] (FONLL).^ A calculation with this level of accuracy is 
also available for b production in e+e~ collisions [17], and has been used for the extraction 
of non-perturbative fragmentation functions from LEP and SLC data [18]. The equiva- 
lence of the perturbativc inputs allows one to consistently convolute these functions with 
the FONLL 6-quark spectra in hadronic collisions, leading to FONLL predictions for the 
Hi, spectrum. A comparison of these predictions with CDF data at 1.8 TcV for S^-mcson 
production in the range 6 GeV < pT < '^0 GeV has been presented in [19]. There the ratio 
between data and theory, averaged over the given pT range, was reduced to a factor of 1.7, 
compatible with the residual theoretical and experimental uncertainties. This finding is 
consistent with experimental evidence from DO (see the last paper in [5]) that the inclu- 
sive rate of jets containing b quarks - a quantity largely insensitive to the details of the 
perturbative and non-perturbative fragmentation - agrees with NLO QCD [20]. 

Non-perturbative fragmentation effects are expected to play a much reduced role in 
the prediction of the total production rate, as they only smear the pr spectrum of the 
quark. Furthermore, effects due to initial-state multiple-gluon emission average to after 
Pt integration. It follows that the measurement of b production down to ~ provides a 
crucial input for clarifying the origin of the residual discrepancy between QCD and data: an 
improved agreement would strongly support the idea that the blame rests on our incomplete 
understanding of the fragmentation phase. A residual, or increased, disagreement, would 
support the relevance of small-x effects, which are expected to grow at low pr- For this 
reason, the recent CDF release [21] of preliminary data on Hb production at y/S = 1.96 TeV 
in the domain pT > 0, \yHi,\ < 0.6 provides us with a new, crucial input. In this paper we 
therefore re-evaluate the theoretical predictions developed in [19], extending the range in 
PT down to 0, and reviewing the theoretical systematics. In addition, we compare these 
results with those obtained using the MC@NLO code [22], which merges the full NLO 
matrix elements with the complete shower evolution and hadronization performed by the 
HERWIG Monte Carlo. As discussed in detail in [22] , this comparison probes a few features 

^The matching with the massive result at low pr is essential, due to the large size of mass corrections 
up to Pt ~ 20 GeV. Lack of mass effects [16] will therefore erroneously overestimate the production rate at 
small Pt- 
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where FONLL and MC@NLO differ by effects beyond NLO: tlie evaluation of subleading 
logarithms in higher-order emissions, in particular in the case of gluon emission from the b 
quark, and the hadronization of the heavy quark, which in MC@NLO is performed through 
HERWIG's cluster model, tuned on Z° HjjX decays. 



2. Theoretical results and uncertainties 



We start by considering the integrated cross section for b quarks, reported by CDF in 
the domain px > 0, \yb\ < 1. This depends only indirectly on the fragmentation and 
on the resummation of px logarithms, so we start by quoting the results obtained with 
the standard NLO calculation. The total cross section is defined by the integral of the 
single-inclusive pj- distribution, with renormalization and factorization scales defined by 
Hr^f = £,R,FlJ'0, where //g = Pt^ + rn^- The central values of our predictions are obtained 
with $,ii^F = 1 and = 4.75 GeV. To avoid the accidental compensation between the /xp 
and the fiji dependence of the cross section occurring when the two scales are kept equal, 
we compute the scale uncertainty by varying and fj,p independently over the range 
0.5 < ^R^F < 2, with the constraint 0.5 < ^r/(,f < 2. The mass uncertainty corresponds 
to the range 4.5 GeV < mb < 5 GeV. The result, using CTEQ6M [23] as PDF, is: 

< 1) = 23.6 ^^-J 7; Mb. (2.1) 

For comparison, the full FONLL calculation and MC@NLO lead to slightly larger central 
values, 25.0 fih and 25.2 ^b respectively. This difference is due to higher-order effects 
included in these calculations, which render the rapidity distribution narrower than that 
computed at the NLO. 

We studied the PDF uncertainties using three different sets of fits with systematics: 
CTEQ6 [23], MRST [24] and Alekhin [25]. Following the prescriptions in those papers, we 
get:^ 

a^^^{\yb\ < 1, CTEQ) = 23.6 ± 2.3pdf ^ih , (2.2) 
o-^^^dyfel < 1, MRST) = 20.8 ± O.DpDp /ib , (2.3) 
(^h^^{\yb\ < 1, Alekhin, FFN) = 24.3 ± 1.3pdf fih , (2.4) 
(^b^°i.\yb\ < 1, Alekhin, VFN) = 22.4 ± 1.2poF nh . (2.5) 

The two Alekhin values refer to the choice of the fixed (FFN) and variable (VFN) fiavour 
number schemes. We note that the CTEQ prediction has an uncertainty twice as large as 
the others. This is a known fact, related to the different prescriptions used to define the 
error ranges. A similar result holds in fact for other observables [27]. Secondly, we note 
that the MRST central value is only barely consistent with the others, even within the 
error bars. 



^We verified that the prescription proposed for CTEQ's sets in [26] leads to minimal differences in the 
results. 
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Figure 1: Lower panel: variations of the FONLL 6-quark pr spectrum {yi, = 0) due to PDF 
uncertainties, normalized to the central CTEQ prediction. Upper panel: variation of the spectrum 
due to scale and mass variations, normalized to the default prediction. 



We move now to the FONLL analysis of the pT spectra. The lower panel of Fig. 1 
shows the PDF uncertainty of the 6-quark px spectrum at y^, = 0. We plot the upper and 
lower results obtained using the CTEQ6, MRST and Alekhin fits, normalized to the central 
CTEQ6M prediction. In all cases, mf, = 4.75 GeV and = 1- The spectrum confirms 
a poor consistency between MRST results and the others. Only for pT > 100 GeV the 
three bands are consistent with each other. In the region below ~ 50 GeV, the theoretical 
uncertainty is however dominated by the effects of scale and mass variation, which are 
shown in the upper panel of Fig. 1. The scale variation is obtained by varying /xr and 
as was done for the total rates. What we show is the envelope of the upper and lower 
results. The points on the curves therefore do not necessarily all correspond to the same 
scale choice, and the shape is not representative of a specific scale choice. 

We summarize these results in Fig. 2, while at the same time going from the quark to 
the hadron level. The solid curves give the error band for the pT{Hb) spectrum as predicted 
by FONLL by summing in quadrature the scale, mass and PDF uncertainties. We selected 
CTEQ6M for our central prediction, and applied a 10% PDF uncertainty in each pT bin, 
to reflect the effects shown in Fig. 1. The inclusion of non-perturbative effects related 
to the b — > Hf, fragmentation has been performed according to the framework described 



- 4 - 




5 10 15 20 25 



PT(Hj/GeV 

Figure 2: The Hh spectrum for \yHi,\ < 0.6. Hi, stands for a 6 or 5 flavoured hadron. The cross 
section has been multiphed hy B = BR{Hh J/^t fi^ii^) — 6.82 x lO"** for future convenience. 
The solid curves give the FONLL overall uncertainty band as described in the text. The central 
solid curve corresponds to the central value, namely that obtained with CTEQ6M, /i = /io, and 
TOf, — 4.75 GeV. The histograms present MC@NLO results obtained with different b hadronization 
parameters. 

in [19]. The momentum fraction z (see below) used in this approach for scaling the b 
quark momentum to the Hi, hadron one is extracted from a normalized Kartvelishvili et al. 
distribution, D(z) = {a + l){a + 2)x"(l — x) [28]. The phenomenological parameter a is 
fixed to 25.6, 29.1, 34 for mb = 4.5, 4.75 and 5 GeV respectively by tuning it to moments 
of e^e~ data (see [19]). We emphasize that these values for a depend also on other details 
and parameters of the perturbative calculation (like, for instance, the value of the strong 
coupling as), and cannot therefore simply be used with other 6-quark perturbative spectra 
without retuning them to e~'~e~ data. 

In Fig. 2 we also present the results obtained with MC@NLO, using default values 
of mass, scales and PDF,^ and for various choices of the b hadronization parameters, as 
described in [22]. The dashed histogram corresponds to the HERWIG default, whereas 
the solid (dotted) histogram has been obtained by setting PSPLT(2)=0.2 and CLSMR(2)=1 
(PSPLT(2)=0.5). The agreement with the central FONLL prediction is satisfactory in terms 

^We verified that mass, scale, and PDF uncertainties in MC@NLO are of the same size as those in 
FONLL. 
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Figure 3: Dependence of the pxiHi,) spectrum on the choice of fragmentation prescription in 
FONLL (upper panel), and on the b hadronization parameters in MCQNLO (lower panel, same 
curves as in Fig. 2). 



of shape and rate in the case of the sohd histogram; the other two choices of b hadronization 
parameters are seen to return softer pt{Hi,) spectra. 

Different choices of the b hadronization parameters in HERWIG correspond to different 
treatments of fragmentation in FONLL; the latter give additional uncertainties with respect 
to those, of purely perturbative origin, that we discussed so far. Here, we shall not study 
the fragmentation uncertainties systematically, and we shall limit ourselves to illustrate 
the typical sizes of these effects. As a result, we shall not cumulate the effect of the 
following variations with those above. The choice of the fragmentation variable for the 
b Hb transition, which at high px is irrelevant, may lead to some differences at small 
Pt, where non-factorizable effects can be significant. The upper panel of Fig. 3 shows 
the effect of three separate choices of fragmentation. The solid line is the default used 
in this work, p{Hi,) = zp(b), the three- momenta being taken in the laboratory frame 
(the curves in Fig. 2 have been obtained with this choice). The dotted line corresponds 
to pt{Hi,) = zpT(b) and yn^, = Uh- The dashed line to {E + p)^^ = z{E + p)^. For 
comparison, the lower panel of Fig. 3 presents the MC@NLO results already shown in 
Fig. 2. Different fragmentation/hadronization choices lead to different shifts in yHi,^ which 
can have an impact on the cross section within the \yHi,\ < 0.6 range when pT ^ rrib. 
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For FONLL, these shifts modify the total rate at the level of ~ 10%, while no significant 
change is observed with the MC@NLO hadronization. This is not surprising, since the 
fragmentation mechanism is strictly speaking only applicable in the large-pT region, while 
the hadronization embedded into parton shower Monte Carlos should work for any pr^s 
(and thus requires more tuning to data) . 



3. Compcirison with experimental data 

We present now the comparison of our results with data. Since the CDF measurement 
is performed using inclusive B — J/tp + X decays, with J/ijj ^ M"*^/^"; we first quote 
results for this observable. The FONLL prediction uses the B ^ J /ip + X spectra mea- 
sured by BaBar and CLEO [29], applied to all Hi, states. Wc verified that the priJ/i^) 
distribution shows little sensitivity to the input choice for the decay spectrum. On the 
other hand, MC@NLO generates the B ^ J /ip + X decays using the perturbative b ^ c 
decay spectrum. The resulting pt{J/iP) spectra are compared in Fig. 4. At variance with 
the case of Fig. 2, the J/ip spectra predicted by MC@NLO are generally harder than that 
predicted by FONLL. The best agreement is obtained with the choice of the h hadroniza- 
tion parameters that gives the worst agreement in the case of the pt{Hi,) spectrum. Since 
the B ^ J/ip fragmentation in FONLL is obtained directly from data, this points out 
that a more realistic treatment of the B decay would be needed in HERWIG. Clearly, this 
effect can be eliminated by replacing the internal HERWIG B-decsy routines with standard 
-B-decay packages. 

Defining aj/^ = a{H^ J /ip,pT{J /ip) > L25,|yj/^| < 0.6) x BR(J/^ ^ /^+/^~), 
where [30] 

BR(J/V' ^ = 5.88 X 10-2 

and, to be used below, 

BR{Hb J/ip) = 1.16 X 10-2 , 

B = BR{Hh J/ip ii-^ii') = 6.82 x lO"'' , 

CDF's result is: 

in excellent agreement with ours: 



-^M = 19-9 nb , (3.1) 

—0.2 stat+syst 



where we combined in quadrature the uncertainties from scale, mass and PDF variations. 
CDF then deconvolutes the Hb J/ip decay, quoting the following result for gh,, = 
a{Hb,\yH,\ < 0.6) X B: 



agf = 24.5 nb , (3.3) 

stat+syst 
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Figure 4: FONLL prediction for the J/tp spectrum (solid curve), for central values of mass, scales, 
PDF, and fragmentation scheme, compared to MC@NLO results (histograms). 



in good agreement with our estimate: 



FONLL 



22.9 ^''\h . 

-7.8 



(3.4) 



After dividing the Hi, rate by a factor B x 0.61 to correct for the branching ratio and for 
the different yb range,^ CDF gives a 6-quark cross section for \yh\ < 1: 



< 1) 



29.4 



h6.2 



-5.4 stat+syst 



/ib , 



(3.5) 



to be compared with our estimate (Eq. (2.1) with the additional 10% PDF uncertainty, 
rescaled to the FONLL result): 



FONLL 



(ly^l < 1) 



+12.6 , 

25.0 ub . 

-8.1 



(3.6) 



We note that the difference between the central values of data and theory increases from 
5% to 15% when going from the result closest to the direct experimental measurement (the 
J/il) cross section) to those which require more deconvolution and acceptance corrections 
(the inclusive b quark rate). This seems to indicate intrinsic differences, to be ultimately 
considered as part of the systematic error, in the modeling of the transition from the quark 



*The correction factor for the yt, range predicted by FONLL and MC@NLO is also 0.61. 
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Figure 5: CDF J/ip spectrum from B decays. The theory band represents the FONLL systematic 
uncertainties, propagated from Fig. 2. Two MC@NLO predictions are also shown (histograms), 
with the same patterns as in Fig. 3. 



to the hadrons, and vice versa. Effects of this size are consistent with what we showed in 
Fig. 3. 

We finally present in Fig. 5 our prediction for the J /ip spectrum, obtained by convo- 
luting the FONLL result with the J/ij: momentum distribution in inclusive B — > J /ip + X 
decays.^ The data lie well within the uncertainty band, and are in very good agreement 
with the central FONLL prediction. We also show the two MC@NLO predictions corre- 
sponding to the two extreme choices of the h hadronization parameters considered in this 
work; very good agreement with data is obtained for one of them in terms of shape, with 
the normalization being slightly low (still within Icr of the mass and scale uncertainties). 

We stress that both FONLL and MC@NLO are based on the NLO result of [7] (hence- 
forth referred to as NDE), and only marginally enhance the cross section predicted there, 
via some higher-order effects. The most relevant change in FONLL with respect to old pre- 
dictions lies at the non-perturbative level, i.e. in the treatment of the h — > Hi, hadronization, 
which makes use [19] of the moment-space analysis of the most up-to-date data on h frag- 
mentation in e"'"e~ annihilation. The evolution of the NLO theoretical predictions over time 

^An earlier version of this work had a curve with a slightly different slope. We correct here an accidental 
error in the treatment of the Hi, decay: in the previous version the 6-quark mass, instead of the 6-hadron 
mass (which we take equal to 5.3 GeV), was used in the boost to the Hb rest frame before its decay. 
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Figure 6: Evolution of the NLO QCD predictions over time, for a/S* = 1800 GeV. 



is shown in Fig. 6. Here we plot the original central prediction of NDE for y/S =1.8 TeV 
(symbols), obtained using NLO QCD partonic cross sections convoluted with the PDF set 
available at the time, namely DFLM260 [31]. The same calculation, performed with the 
CTEQ6M PDF set (dotted curve), shows an increase of roughly 20% in rate in the region 
Pt < 10 GeV. The effect of the inclusion of the resummation of NLL logarithms is displayed 
by the dashed curve, and is seen to be modest in the range of interest. Finally, we compare 
the original NDE prediction after convolution with the Peterson fragmentation function 
(e = 0.006, dot-dashed curve), with the FONLL curve convoluted with the fragmentation 
function extracted in [19] (solid curve). Notice that the effect of the fragmentation ob- 
tained in [19] brings about a modest decrease of the cross section (the difference between 
the dashed and solid curves), while the traditional Peterson fragmentation with e = 0.006 
has a rather pronounced effect (the difference between the symbols and the dot-dashed 
curve). Thus, the dominant change in the theoretical prediction for heavy flavour produc- 
tion from the original NDE calculation up to now appears to be the consequence of more 
precise experimental inputs to the bottom fragmentation function [18], that have shown 
that non-perturbative fragmentation effects in bottom production are much smaller than 
previously thought. 

We recall once more that, at the level of central values and neglecting the theoretical 
and experimental uncertaintites, the FONLL result [19], using CTEQ5M PDF, underesti- 
mates CDF's run IB rate [4] for px ^ Q GeV by a factor of 1.7. The main improve- 
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Figure 7: Evolution of the CDF data for exclusive production: Run IA[3], Run IB [4] and 
Run II [21]. 



ment in the comparison between Run II data and theory comes from the new CDF data, 
which tend to be lower than one would have extrapolated from the latest measurements 
at 1.8 TeV. To clarify this point, we collect in Fig. 7 the experimental results from the 
CDF measurements of the cross section in Run lA [3], in Run IB [4] and in Run II. 
The rate in the bin which in the past showed the worse discrepancy, namely the first bin 
corresponding to pt{B^) > 6 GeV, evolved from 2.7 ± 0.6 ^b (Run lA) to 3.6 ± 0.6 /ib 
(Run IB), and decreased to 2.8 it 0.4 fih in Run II. The increase in the cm. energy should 
have instead led to an increase by 10-15%. The Run II result is therefore lower than the 
extrapolation from Run IB by approximately 30%. By itself, this result alone would reduce 
the factor of 1.7 quoted in [19] to 1.2 at ^/S = 1.96 TeV. In addition, the results presented 
in this paper lead to an increase in rate relative to the calculation of [19] by approximately 
10-15%, due to the change of PDF from CTEQ5M to CTEQ6M. The combination of these 
two effects, dominated in any case by the lower normalization of the new data, leads to 
the excellent agreement observed in our work. We then conclude that the improved agree- 
ment between the Run II measurements and perturbative QCD is mostly a consequence of 
improved experimental inputs (which include up-to-date ag and PDF determinations). 
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4. Conclusions 

In summary, the recent CDF measurement of total 6-hadron production rates in pp collisions 
at VS = IM TeV is in good agreement with NLO QCD, the residual discrepancies being 
well within the uncertainties due to the choice of scales and, to a lesser extent, of mass and 
PDF. A similar conclusion is reached for the pT spectrum, where the calculation is improved 
by the inclusion of the NLL resummation of collinear logarithms. The improvement in the 
quality of the agreement between data and theory relative to previous studies is the result 
of several small effects, ranging from a better knowledge of fragmentation and structure 
functions and of a^, which constantly increased in the DIS fits over the years, to the fact that 
these data appear to lead to cross sections slightly lower than one would have extrapolated 
from the measurements at 1.8 TeV. The current large uncertainties in data and theory leave 
room for new physics. However there is no evidence now that their presence is required for 
the description of the data, and furthermore the recent results of [32] rule out the existence 
of a scalar bottom quark in the range preferred by the mechanism proposed in [8]. The 
data disfavour the presence of small-x effects of the size obtained with the approaches of 
rcfs. [13]. They are instead compatible with the estimates of [9]. We thus conclude that 
approaches to the small-x problem that can exactly include the NLO corrections (i.e. that 
can be used to perform a consistently matched calculation) should be further pursued, 
also in light of the recent progress in small-x resummation [33]. It is not unlikely that 
the consistent inclusion of small-x effects could further reduce the theoretical errors due to 
scale variation and PDF uncertainties. A final confirmation of the CDF data, as well as the 
extension of the measurements to the high-pT region (pT > 40 GeV) , where the theoretical 
uncertainty from scale variations is significantly reduced, would provide additional support 
to our conclusions. While our work has no direct impact on other anomalies reported by 
CDF in the internal structure and correlations of heavy-flavoured jets [34], we do expect 
that the improvements relative to pure parton-level calculations present in the MC@NLO 
should provide a flrmer benchmark for future studies of the global final-state stucture of 
bb events. 
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